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DETAILED ACTION 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

1 . Claims 1 -6, 1 6, 27, 28, 37-41 , 43, 44, 59, 60, 62 and 63 are rejected under 35 

U.S.C. 103(a) as being unpatentable over Griffin et al. (U.S. Patent 5,701,390), 

hereinafter referred to as Griffin, in view of Barnwell et al. ("Speech Coding: A computer 

laboratory textbook," 1966, John Wiley & Sons, Inc.), hereinafter referred to as 

Barnwell. 

Regarding claim 1, Griffin discloses a method for the synthesis of MBE-based 
coded speech using regenerated phase information. Griffin's method includes the 
following: 

• dividing the speech model parameters into frames, wherein a frame of speech 
model parameters includes pitch information, voicing information determining the 
voicing state in one or more frequency regions, and spectral information (col. 3, lines 4- 
12; col. 9, lines 28-35); 

• computing a first digital filter using a first frame of speech model parameters, 
wherein the frequency response of the first digital filter corresponds to the spectral 
information in frequency regions where the voicing state equals the selected voicing 
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state (Fig. 2, col. 4, lines 38-65; digital filters are used to synthesize the speech, 
excited by the appropriate input [v/uv]); and col. 13, line 60 through col. 14, line 7); 

• computing a second digital filter using a second frame of speech model 
parameters, wherein the frequency response of the second digital filter corresponds to 
the spectral information in frequency regions where the voicing state equals the 
selected voicing state (Fig. 2, col. 4, lines 38-65; parameters from sequential packets 
are loaded creating different filters, which are excited according to voicing state; and 
col. 13, line 60 through col. 14, line 7, sequential packets can overlap). 

• combining the first signal samples with the second signal samples to produce a 
set of digital speech samples corresponding to the selected voicing state (Fig. 2, 
Synthetic speech produced). 

As stated above, Griffin teaches the use of multiple filters (with spectral and 
voicing information), but Griffin does not specifically teach the following: 

• determining a set of pulse locations; 

• producing a set of first signal samples from the first digital filter and the pulse 
locations; 

• producing a set of second signal samples from the second digital filter and the 
pulse locations; 

However, the examiner contends that these concepts were well known in the art, 
as taught by Barnwell. 

In the same field of endeavor, Barnwell teaches speech coding where a filter is 
"programmed" with coefficients and excited with pulses (pp. 85-89, Fig. 5.2), where the 
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pulses will necessarily have a separation (pitch period — location) and sequential sets of 
samples (from frames or subframes) will produce a signal. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Griffin by specifically providing the features, 
as taught by Barnwell, because it is well known in the art at the time of invention for the 
purpose of producing synthesized speech at a decoder using low bandwidth 
transmissions (Barnwell, p. 85, Introduction). 

Regarding claim 2, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1 ). In addition, Griffin teaches "wherein the frequency 
response of the first digital filter and the frequency response of the second digital filter 
are zero in frequency regions where the voicing state does not equal the selected 
voicing state" (col. 13, line 62 through col. 14, line 6). 

Regarding claim 3, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). In addition, Griffin teaches "wherein the spectral 
information includes a set of spectral magnitudes representing the speech spectrum at 
integer multiples of a fundamental frequency" (col. 4, lines 55-61). 

Regarding claim 4, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). In addition, Griffin teaches "wherein the speech model 
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parameters are generated by decoding a bit stream formed by a speech encoder" (col. 
9, lines 26-30). 

Regarding claim 5, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). In addition, Griffin teaches "wherein the voicing 
information determines which frequency regions are voiced and which frequency 
regions are unvoiced" (col. 13, line 60 through col. 14, line 5). 

Regarding claim 6, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 5). In addition, Griffin in view of Barnwell (see rejection of 
claim 1 ) teaches "wherein the selected voicing state is the voiced voicing state and the 
pulse locations are computed such that the time between successive pulse locations is 
determined at least in part from the pitch information" (in particular, Barnwell, Fig. 5.2, 
the pitch period determines the space between the excitation pulses). 

Regarding claim 16, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). Barnwell teaches "wherein the selected voicing state is a 
pulsed voicing state" (p. 88, Fig. 5.2, voiced excitation can be generated by a pulse 
generator in support of low bandwidth transmission, see claim 1 rejection). 

Regarding claim 27, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1 ). In addition, Griffin teaches "wherein the spectral 
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information includes a set of spectral magnitudes representing the speech spectrum at 
integer multiples of a fundamental frequency" (col. 4, lines 55-60). 

Regarding claim 28, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1 ). In addition, Barnwell teaches "wherein the speech model 
parameters are generated by decoding a bit stream formed by a speech encoder" (col. 
3, lines 4-22). 

Regarding claim 37, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1 ). In addition, Griffin teaches "wherein the digital speech 
samples corresponding to the selected voicing state are further combined with other 
digital speech samples corresponding to other voicing states" (Fig. 2, col. 13, line 62 
through col. 14, line 7). 

Regarding claim 38, this claim has limitations similar to claim 1 and is rejected 
for the same reasons. 

Regarding claim 39, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 38). In addition, Griffin teaches "wherein the digital speech 
samples for the subframe corresponding to the selected voicing state are further 
combined with digital speech samples for the subframe representing other voicing 
states" (Fig. 2, col. 13, line 62 through col. 14, line 7). 
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Regarding claim 40, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 39). In addition, Griffin teaches "wherein the voicing 
information includes one or more voicing decisions, with each voicing decision 
determining the voicing state of a frequency region in the subframe" (col. 13, line 62 
through col. 14, line 7). 

Regarding claim 41, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 40). In addition, Griffin teaches "wherein each voicing 
decision determines whether a frequency region in the subframe is voiced or unvoiced" 
(col. 13, line 62 through col. 14, line 7). 

Regarding claim 43, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 41 ). In addition, Barnwell teaches "wherein each voicing 
decision further determines whether a frequency region in the subframe is pulsed" (Fig. 
5.2 voicing selected the pulse generator that generates the appropriate frequency 
response when passed through the filter. 

Regarding claim 44, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 41 ). In addition, Griffin in view of Barnwell teach "wherein the 
selected voicing state is the voiced voicing state and the pulse locations depend at 
least in part on the decoded pitch information for the subframe" (Griffin, Fig. 2, 
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decodes information resulting in V k going to "voicing band determination" module; 
Barnwell, Fig. 5.2, pitch and voicing information go to pulse generator). 

Regarding claim 59, this claim has limitations similar to claim 40 and is rejected 
for the same reasons. 

Regarding claim 60, this claim has limitations similar to claim 41 and is rejected 
for the same reasons. 

Regarding claim 62, this claim has limitations similar to claim 43 and is rejected 
for the same reasons. 

Regarding claim 63, this claim has limitations similar to claim 44 and is rejected 
for the same reasons. 

2. Claims 7, 42, 45, 46, 49, 61 , 64, 65 and 68 are rejected under 35 U.S.C. 103(a) 
as being unpatentable over Griffin in view of Barnwell and further in view of well known 
prior art (MPEP 2144.03). 

Regarding claims 7, 42, 45, 61 and 64 Griffin in view of Barnwell teaches 
everything claimed, as applied above (see claim 6, 41, 60, 63, respectively), but Griffin 
in view of Barnwell does not specifically teach "the pulse locations are reinitialized if 
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consecutive frames or subframes are predominately not voiced, and future determined 
pulse locations do not substantially depend on speech model parameters corresponding 
to frames or subframes prior to such reinitialization." However, the examiner takes 
official notice of the fact that reinitialization after a period of non-pulsed operation was 
well known in the art. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Griffin in view of Barnwell, because voiced 
operation is more accurate of the pulses are synchronized to the beginning of a voiced 
segment. 

Regarding claims 46, 49, 65 and 68 Griffin in view of Barnwell teaches 
everything claimed, as applied above (see claim 45, 43, 63, and 62, respectively), but 
Griffin in view of Barnwell does not specifically teach "the frequency responses of the 
first impulse response and the second impulse response correspond to the decoded 
spectral information in voiced frequency regions and the frequency responses are 
approximately zero in other frequency regions." However, the examiner takes official 
notice of the fact that a pulsed excitation will generate a frequency response and that 
the non-voiced segments will typically have a much lower energy response. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Griffin in view of Barnwell, because voiced 
operation is more accurate of the pulses are synchronized to the beginning of a voiced 
segment. 
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Allowable Subject Matter 

3. Claims 8-1 5, 1 7-26, 29-36, 47, 48, 50-58, 66, 67 and 69-77 are objected to as 
being dependent upon a rejected base claim, but would be allowable if rewritten in 
independent form including all of the limitations of the base claim and any intervening 
claims. Note the 1 12 2 nd rejections of claims 47, 48, 50-58, 66, 67 and 69-77 has 
precedence. 

Regarding claims 10, 19, 31, 50 and 69, Griffin discloses the synthesis of MBE- 
based coded speech, but Griffin does not teach determining FFT coefficients from the 
decoded model parameters for the first frame in frequency regions where the voicing 
state equals the selected voicing state; processing the FFT coefficients with an inverse 
FFT to compute first time-scaled signal samples; interpolating and resampling the first 
time-scaled signal samples to produce first time-corrected signal samples; and 
multiplying the first time-corrected signal samples by a window function to produce the 
first digital filter. Thus the cited prior art alone or in combination, does not fairly suggest 
or disclose the claimed combination of features. 

Regarding claims 8, 17 and 29, Griffin discloses method for reconstructing the 
spectral envelope and voicing information for each of a plurality of frames, but Griffin 
does not teach that the first digital filter is computed as the product of a periodic signal 
and a pitch-dependent window signal, and the period of the periodic signal is 
determined from the pitch information for the first frame. Thus the cited prior art alone 
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or in combination, does not fairly suggest or disclose the claimed combination of 
features. 



Regarding claims 25, 47 and 66, Griffin discloses method for synthesizing 
speech that includes the use of sinusoidal oscillators determined in part from the from 
the fundamental frequency, but Griffin does not teach that the pulse location 
corresponds to a time offset associated with an impulse in an impulse sequence, the 
first signal samples are computed by convolving the first digital filter with the impulse 
sequence, and the second signal samples are computed by convolving the second 
digital filter with the impulse sequence. 



Response to Arguments 

4. Applicant's arguments filed 2/21/06 have been fully considered but they are not 
persuasive. 



5. Applicant asserts on page 16: 

Recognizing that Griffin does not describe or suggest determining a set of 
pulse locations, producing sets of first and second signal samples using 
the digital filters and the pulse locations, and combining the first and 
second signal samples to produce digital speech samples, the rejection 
asserts that doing so was well known, as evidenced by Barnwell. 
Applicant respectfully disagrees. In particular, the passage of Barnwell 
identified in the rejection (pages 85-89) merely describes well known 
linear predictive coding techniques and in no way describes or suggests 
the recited producing of sets of first and second signal samples using the 
digital filters and the pulse locations, or the recited combining of the first 
and second signal samples to produce digital speech samples. 
Accordingly, for at least these reasons, the rejection of claim 1 and its 
dependent claims should be withdrawn. 
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Griffin teaches the determining of spectral and vocing information for frequency 
bands of a frame (abstract, col. 5, lines 58-62, i.e. the computing of a first and second 
digital filter) where it is noted that determining vocing information necessarily 
determines pulse excitation locations (i.e., fundamental frequency = pitch -> pitch period 
-> filter excitation). Finally, Griffin teaches the combining of the information during 
synthesis (Fig. 2). Barnwell teaches the more specific operations of using voicing 
information along with the spectral information (or filter coefficients) to produce the 
synthesized output (i.e., pulse generator with pitch locations exciting the filter). When 
Barwell's teachings are combined with those of Griffin you get "producing of sets of first 
and second signal samples using the digital filters and the pulse locations", and "the 
recited combining of the first and second signal samples to produce digital speech 
samples." 

Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
'shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
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the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to V. Paul Harper whose telephone number is (571) 272- 
7605. The examiner can normally be reached on M-F. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on (571 ) 272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



3/20/06 




V. Paul Harper 
Patent Examiner 
Art Unit 2654 



